Arabic Morphological Analyzer with Agglutinative Affix Morphemes and Fusional Concatenation Rules

نویسندگان

  • Fadi A. Zaraket
  • Jad Makhlouta
چکیده

Current concatenative morphological analyzers consider prefix, suffix and stem morphemes based on lexicons of morphemes, and morpheme concatenation rules that determine whether prefix-stem, stem-suffix, and prefix-suffix concatenations are allowed. Existing affix lexicons contain extensive redundancy, suffer from inconsistencies, and require significant manual work to augment with clitics and partial affixes if needed. Unlike traditional work, our method considers Arabic affixes as fusional and agglutinative, i.e. composed of one or more morphemes, introduces new compatibility rules for affix-affix concatenations, and refines the lexicons of the SAMA and BAMA analyzers to be smaller, less redundant, and more consistent. It also automatically and perfectly solves the correspondence problem between the segments of a word and the corresponding tags, e.g. part of speech and gloss tags. Title and Abstract in another language, L2 (optional, and on same page) éJ k. AÓY K @ éJ ̄Qå• Y«@ñ̄ ÈAÒa JƒA K. éJ K. QaË@ ñ’ JË ú ̄ Qå”Ë@ ÉJ Êj JË @ ‡kBð øXAK. ‡Êa JÒ» AîE. ‡Êa JK AÓð éÒÊ3⁄4Ë@ ɓ@ I. ‚m ' èQå•AaÖÏ @ éJ K. QaË@ éJ ËA’ B@ éJ ̄Qå”Ë@ HCÊjÖÏ @ AÒîD ”aJ. K. ð @ ɓ BAK. ‡kCË@ð øXAJ. Ë @ ÈA’ @ ém • XYm ' HA KñoÒÊË ÈA’ @ Y«@ñ̄ð Ñk. AaÓ XAÒ J«AK. ø ðYK Yêk. úÍ@ h. A Jm '𠇃A J JË @ Ð @Ya K @ áÓ ú GAa Kð P@Qo JË @ áÓ Q JoË@ ø ñ Jm ' éJ ËAmÌ'@ Ñk. AaÖÏ @ . ‘aJ. Ë @ A Jj. î DÓ Q . JaK , éK YJ Ê ® JË @ HAm '. B@ ¬C m '. . AîD Ë @ éJ K Qk. HA ®Êa JÓ é ̄ A “@ úÍ@ ék. AmÌ'@ ÈAg ú ̄ Ñ m • ɒ Y«@ñ̄ A Jj. î DÓ ÐY ®K ð . Yg@ð ú ̄ Qå• àñoÓ áÓ Q » @ áÓ Aë ðA JK. áoÖß ð éJ k. AÓY K @ HA ®Êa JÖÏ @ Ñk. AaÓ A Jj. î DÓ H. Y ‚  . ‡kCË@ ‡Êa JÒÊË QÓ B@ 1⁄2Ë Y»ð , øXAK. áK ño JË á J K Qk. á KXAK. á ®Êa JÓBAMA ðSAMA Ð PC JË @ éÊ3⁄4 ‚Ó ÉÓA¿ É3⁄4 ‚ . ð AJ Ë @ A Jj. î DÓ Ém ' A ’ @ . A ®ƒA J K Q » @ð @P@Qo K É̄ @ð Q a“ @ AêÊaj. J Ë . è A JaÓ ð @ H. @Q«B@ áÓ àñoÖÏ @ © ̄ñÓ É JÒ» AîE. é ®jÊÖÏ @ HA ®J Êa JË @ð éJ ̄Qå”Ë@ éÒÊ3⁄4Ë@ Z @ Qk. @ á K.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Unsupervised Morpheme-Based HMM for Hebrew Morphological Disambiguation

Morphological disambiguation is the process of assigning one set of morphological features to each individual word in a text. When the word is ambiguous (there are several possible analyses for the word), a disambiguation procedure based on the word context must be applied. This paper deals with morphological disambiguation of the Hebrew language, which combines morphemes into a word in both ag...

متن کامل

An Affix Stripping Morphological Analyzer for Turkish

This paper presents the design and the implementation of a morphological analyzer for Turkish. A new methodology is proposed for doing the analysis of Turkish words with an affix stripping approach and without using any lexicon. The rule-based and agglutinative structure of the language allows Turkish to be modeled with finite state machines (FSMs). In contrast to the previous works, in this st...

متن کامل

Standard Arabic formalization and linguistic platform for its analysis

From the beginning of the sixties, and starting with the first automatic analyzer proposed by David Cohen, one of the first theorists of NLP [1], research has continued with natural language processing and especially the automatic treatment of the Arabic language. In 1983, with a minimalist morphological analysis, based on the theory that any Arabic form is generated using root and pattern, res...

متن کامل

Rule Based Morphological Analyzer of Kazakh Language

Having a morphological analyzer is a very critical issue especially for NLP related tasks on agglutinative languages. This paper presents a detailed computational analysis of Kazakh language which is an agglutinative language. With a detailed analysis of Kazakh language morphology, the formalization of rules over all morphotactics of Kazakh language is worked out and a rule-based morphological ...

متن کامل

A functional operator-based morphological analysis of Japanese

A universal set of functional operators as proposed in Role and Reference Grammars can be used to provide a robust morphology analyser development scheme, which gives the developer of the analyser a clear guiding principle guaranteeing the exhaustiveness of his grammar from the inception of the development task, freeing him from the complex bookkeeping of continuation lexicons often associated ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012